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ABSTRACT 

It is now well established that galaxies are biased tracers of the distribution 
of matter, although it is still not known what form this bias takes. In local 
bias models the propensity for a galaxy to form at a point depends only on the 
overall density of matter at that point. Hierarchical scaling arguments allow 
one to build a fully-specified model of the underlying distribution of matter 
and to explore the effects of local bias in the regime of strong clustering. Using 
a generating- function method developed by Bernardeau & Schaeffer (1992), 
we show that hierarchical models lead one directly to the conclusion that a 
local bias does not alter the shape of the galaxy correlation function relative 
to the matter correlation function on large scales. This provides an elegant 
extension of a result first obtained by Coles (1993) for Gaussian underlying 
fields and confirms the conclusions of Scherrer & Weinberg (1998) obtained 
using a different approach. We also argue that particularly dense regions 
in a hierarchical density field display a form of bias that is different from 
that obtained by selecting such peaks in Gaussian fields: they arc themselves 
hierarchically distributed with scaling parameters Sp — p^~'^^ . This kind of bias 
is also factorizable, thus in principle furnishing a simple test of this class of 
models. 

Subject headings: galaxies: statistics - large scale structure of the Universe 
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1. Introduction 

The biggest stumbling-block for attempts to confront theories of cosmological structure 
formation with observations of galaxy clustering is the uncertain and possibly biased 
relationship between galaxies and the distribution of gravitating matter. The idea that 
galaxy formation might be biased goes back to the reahzation by Kaiser (1984) that the 
reason Abell clusters display stronger correlations than galaxies at a given separation is that 
these objects are selected to be particularly dense concentrations of matter. As such, they 
are very rare events, occurring in the tail of the distribution function of density fluctuations. 
Under such conditions a "high-peak" bias prevails: rare high peaks are much more strongly 
clustered than more typical fluctuations (Bardeen et al. 1986). If the properties of a galaxy 
(its morphology, color, luminosity) are influenced by the density of its parent halo, for 
example, then differently-selected galaxies are expected to a different bias (e.g. Dekel & 
Rees 1987). Observations show that different kinds of galaxy do cluster in different ways 
(e.g. Loveday et al. 1995; Hermit et al. 1996). 

In local bias models, the propensity of a galaxy to form at a point where the total 
(local) density of matter is p is taken to be some function /(p) (Coles 1993, hereafter 
C93; Fry & Gaztanaga 1993, hereafter FG93). It is possible to place stringent constraints 
on the effect this kind of bias can have on galaxy clustering statistics without making 
any particular assumption about the form of /. In this Letter, we describe the results 
of a different approach to local bias models that exploits new results from the theory of 
hierarchical clustering in order to place stronger constraints on what a local bias can do to 
galaxy clustering. We leave the technical details to Munshi et al. (1999a,b) and Bernardeau 
& Schaeffer (1999); here we shall simply motivate and present the results and explain their 
importance in a wider context. 
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2. Hierarchical Clustering 



The fact that Newtonian gravity is scale-free suggests that the A^-point correlation 
functions of self-gravitating particles, ^j^, evolved into the large-fluctuation regime by the 
action of gravity, should obey a scaling relation of the form 



when the elements of a structure are scaled by a factor A (e.g. Balian & Schaeffer 1989). 
Observations offer some support for such an idea, in that the observed two-point correlation 
function ^ (r) of galaxies is reasonably well represented by a power law over a large range of 
length scales. 



(Groth & Peebles 1977; Davis & Peebles 1977) for r between, say, 100/i-^kpc and 
10h~^ Mpc. The observed three point function, ^3, is well-established to have a hierarchical 
form 



where ^ab — C{'^a,^b), etc, and Q is a constant (Davis & Peebles 1977; Groth & Peebles 
1977). The four-point correlation function can be expressed as a combination of graphs with 
two different topologies - "snake" and "star" - with corresponding (constant) amplitudes 
Ra and Rf, respectively: 

^4(xa, Xfe, Xc, Xd) = RaiUb^bc^cd + • • • (12 tcrms)] + Rbi^abUc^ad + • • • (4 tcrms)] (4) 

(e.g. Fry & Peebles 1978; Fry 1984). 

It is natural to guess that all p-point correlation functions can be expressed as a sum 
over all possible p-tree graphs with (in general) different amplitudes Qp^a for each tree 
diagram topology a. If it is further assumed that there is no dependence of these amplitudes 



ep(Ari,...Ar,)=A-^(^-%(ri,...r^) 



(1) 




(2) 




(3) 
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upon the shape of the diagram, rather than its topology, the correlation functions should 
obey the following relation: 

(p-i) 

a, p— trees labellings edges 

To go further it is necessary to find a way of calculating Qp. One possibility, which appears 
remarkably successful when compared with numerical experiments (Munshi et al. 1999b; 
Bernardeau & Schaeffer 1999), is to calculate the amphtude for a given graph by simply 
assigning a weight to each vertex of the diagram where n is the order of the vertex (the 
number of lines that come out of it), regardless of the topology of the diagram in which it 
occurs. In this case 

<5p,a = n ^n- (6) 

vertices 

Averages of higher-order correlation functions can be defined as 

^p-^l---IU^^---'^p^dV,...dVp. (7) 

Higher-order statistical properties of galaxy counts are often described in terms of the 
scaling parameters Sp constructed from the via 

Sp - A- (8) 

It is a consequence of the particular class of hierarchical clustering models defined by 
equations (5) & (6) that all the Sp should be constant, independent of scale. 



3. Local Bias 

Using a generating function technique, originally developed by Bernardeau & Schaeffer 
(1992), it is possible to derive a series expansion for the m-point count probability 
distribution function of the objects Pm{Ni, ....N^) (the joint probability of finding 
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objects in the i-th cell, where i runs from 1 to m) from the Vn- The hierarchical model 
outlined above is therefore statistically complete. In principle, therefore, any statistical 
property of the evolved distribution of matter can be calculated just as it can for a Gaussian 
random field. This allows us to extend various results concerning the effects of biasing on 
the initial conditions into the nonlinear regime in a more elegant way than is possible using 
other approaches to hierarchical clustering. 

For example, let us consider the joint occupation probability P2(-^i, -^2) for two cells to 
contain A^i and N2 particles respectively. Using the generating-function approach outlined 
above, it is quite easy to show that, at lowest order, 

P2{Ni, N2) = Pi{N,)P^{N2) + Pi{N,)b{N,)P,{N2)b{N2)^u{ru), (9) 

where the Pi{Ni) are the individual count probabilities of each volume separately and ^12 
is the underlying mass correlation function. The function b{Ni) we have introduced in (9) 
depends on the set of Vn appearing in equation (6); its precise form does not matter in this 
context, but the structure of equation (9) is very useful. We can use (9) to define 

where C,NiN2{f^i2) is the cross-correlation of "cells" of occupancy A^i and N2 respectively. 
Prom this definition and equation (9) it follows that 

W.(r) = 6(7Vi)6(7V2)62(r); (11) 
we have dropped the subscripts on r for clarity from now on. ^From (11) we can obtain 

for the special case where Ni = N2 = N which can be identified with the usual definition 
of the bias parameter associated with the correlations among a given set of objects 
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^obj{f) — ^objCmass('^)- Moreover, note that at this order (which is vahd on large scales), 
the correlation bias defined by equation (11) factorizes into contributions bN^ from each 
individual cell (Bernardeau 1996; Munshi et al. 1999b). 

Coles (1993) proved, under weak conditions on the form of a local bias f{p) as discussed 
in the introduction, that the large-scale biased correlation function would generally have 
a leading order term proportional to ^12(^12). In other words, one cannot change the 
large-scale slope of the correlation function of locally-biased galaxies with respect to that 
of the mass. This "theorem" was proved for bias applied to Gaussian fluctuations only and 
therefore does not obviously apply to galaxy clustering, since even on large scales deviations 
from Gaussian behaviour are significant. It also has a more minor loophole, which is that 
for certain peculiar forms of / the leading order term is proportional to ^fg; which falls off 
more sharply than ^12 on large scales. 

Steps towards the plugging of this gap began with FG93 who used an expansion of / 
in powers of 5 and weakly non-linear (perturbative) calculations of ^12(1^) to explore the 
statistical consequences of biasing in more realistic (i.e. non-Gaussian) fields. Based largely 
on these arguments, Scherrer & Weinberg (1998), hereafter SW98, confirmed the vahdity of 
the C93 result in the non-linear regime, and also showed explicitly that non-linear evolution 
always guarantees the existence of a linear leading-order term regardless of /, thus plugging 
the small gap in the original C93 argument. These works have a similar motivation to 
ours, and also exploit hierarchical scaling arguments of the type discussed in §2 en route 
to their conclusions. What is different about the approach we have used in this paper is 
that the somewhat cumbersome simultaneous expansion of / and ^12 used by SW98 is not 
required in our calculation: we use the generating functions to proceed directly to the joint 
probability (9), while SW98 have to perform a complicated sum over moments of a bivariate 
distribution. The factorization of the probability distribution (9) is also a stronger result 
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than that presented by SW98, in that it leads almost trivially to the C93 "theorem" but 
also generalizes to higher-order correlations than the two-point case under discussion here. 

Note that the density of a cell of given volume is simply proportional to its occupation 
number N. The factorizability of the dependence of ^NiN-zi'f^u) upon b{Ni) and b{N2) in 
(11) means that applying a local bias /(p) boils down to applying some bias function 
F{N) — f[b{N)] to each cell. Integrating over all N thus leads directly to the same 
conclusion as C93, i.e. that the large-scale ^(r) of locally-biased objects is proportional to 
the underlying matter correlation function. This has also been confirmed by numerically 
using A/"-body experiments (Mann et al. 1998; Narayanan et al. 1998). 

4. Halo Bias 

In hierarchical models, galaxy formation involves the following three stages: 

1. the formation of a dark matter halo; 

2. the settling of gas into the halo potential; 

3. the cooling and fragmentation of this gas into stars. 

Rather than attempting to model these stages in one go by a simple function / of the 
underlying density field it is interesting to see how each of these selections might influence 
the resulting statistical properties. Bardeen et al. (1986), inspired by Kaiser (1984), 
pioneered this approach by calculating detailed statistical properties of high-density regions 
in Gaussian fluctuations flelds. Mo & White (1996) and Mo et al. (1997) went further 
along this road by using an extension of the Press-Schechter (1974) theory to calculate the 
correlation bias of halos, thus making an attempt to correct for the dynamical evolution 
absent in the Bardeen et al. approach. The extended Press-Schechter approach seem to be 
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in good agreement with numerical simulations, except for small halo masses (Jing 1998). It 
forms the basis of many models for halo bias in the subsequent literature (e.g. Moscardini 
et al. 1998; Tegmark & Peebles 1998). 

The hierarchical models furnish an elegant extension of this work that incorporates 
both density-selection and non-linear dynamics in an alternative to the Mo & White 
(1996) approach. We exploit the properties of equation (H) to construct the correlation 
function of volumes where the occupation number exceeds some critical value. For very 
high occupations these volumes should be in good correspondence with collapsed objects. 

The way of proceeding is to construct a tree graph for all the points in both volumes. 
One then has to re-partition the elements of this graph into internal lines (representing the 
correlations within each cell) and external lines (representing inter-cell correlations). Using 
this approach the distribution of high-density regions in a field whose correlations are given 
by eq. (5) can be shown to be itself described by a hierarchical model, but one in which 
the vertex weights, say M„, are different from the underlying weights z/„ (Bernardeau & 
Schaeffer 1992, 1999; Munshi et al. 1999a,b). 

First note that a density threshold is in fact a form of local bias, so the effects of 
halo bias are governed by the same strictures as described in the previous section. Many 
of the other statistical properties of the distribution of dense regions can be reduced to a 
dependence on a scaling parameter x, where 

X = N/N^. (13) 

In this definition Nc = N^2, where N is the mean number of objects in the cell and ^2 is 
defined by eq. (^ with p = 2. The scaling parameters Sp can be calculated as functions 
of X, but are generally rather messy (Munshi et al. 1999a). The most interesting limit 
when X ^ 1 is, however, rather simple. This is because the vertex weights describing the 
distribution of halos depend only on the z/„ and this dependence cancels in the ratio (0). In 
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this regime, 

Sp = (14) 

for all possible hierarchical models. The reader is referred to Munshi et al. (1999a) for 
details. This result is also obtained in the corresponding limit for very massive halos by 
Mo et al. (1997). The agreement between these two very different calculations supports 
the inference that this is a robust prediction for the bias inherent in dense regions of a 
distribution of objects undergoing gravity-driven hierarchical clustering. 



5. Discussion and Conclusions 

The main purpose of this Letter has been to advertise the importance of recent 
developments in the theory of gravitational-driven hierarchical clustering. The model 
described in equations (5) & (6) provides a statistically-complete prescription for a density 
field that has undergone hierarchical clustering. This allows us to improve considerably 
upon biasing arguments based on an underlying Gaussian field. 

These methods allow a simpler proof of the result obtained by SW98 that strong 
non-linear evolution does not invalidate the local bias theorem of C93. They also imply 
that the effect of bias on a hierarchical density field is factorizable. A special case of this 
is the bias induced by selecting regions above a density threshold. The separability of bias 
predicted in this kind of model could be put to the test if a population of objects could 
be found whose observed characteristics (luminosity, morphology, etc.) were known to 
be in one-to-one correspondence with the halo mass. Likewise, the generic prediction of 
higher-order correlation behaviour described by the behaviour of Sp in equation can also 
be used to construct a test of this particular form of bias. 

Referring to the three stages of galaxy formation described in §4, analytic theory has 
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now developed to the point where it is fairly convincing on (1) the formation of halos. 
Numerical experiments are beginning now to handle (2) the behaviour of the gas component 
(Blanton et al. 1998, 1999). But it is unlikely that much will be learned about (3) by 
theoretical arguments in the near future as the physics involved is poorly understood 
(though see Benson et al. 1999). Arguments have already been advanced to suggest that 
bias might not be a deterministic function of p, perhaps because of stochastic or other 
hidden effects (Dekel & Lahav 1998; Tegmark & Bromley 1999). It also remains possible 
that large-scale non-local bias might be induced by environmental effects (Babul & White 
1991; Bower et al. 1993). 

Before adopting these more complex models, however, it is important to exclude the 
simplest ones, or at least deal with that part of the bias that is attributable to known 
physics. At this stage this means that the 'minimal' bias model should be that based on 
the selection of dark matter halos. Establishing the extent to which observed galaxy biases 
can be explained in this minimal way is clearly an important task. 

ALM acknowledges the support of the NSF-EPSCoR program, and DM acknowledges 
the receipt of an Alexander von Humboldt research fellowship. We thank Bob Scherrer for 
interesting discussions; Francis Bernardaeau and Richard Schaeffer for helpful comments; 
and the referee, David Weinberg, for extremely helpful criticisms of an earlier version of 
this paper. 
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